Determining the minimum number of types necessary to represent the sizes of protein atoms
نویسندگان
چکیده
MOTIVATION Traditionally, for packing calculations people have collected atoms together into a number of distinct 'types'. These, in fact, often represent a heavy atom and its associated hydrogens (i.e. a united atom). Also, atom typing is usually done according to basic chemistry, giving rise to 20-30 protein atom types, such as carbonyl carbons, methyl groups, and hydroxyl groups. No one has yet investigated how similar in packing these chemically derived types are. Here we address this question in detail, using Voronoi volume calculations on a set of high-resolution crystal structures. RESULTS We perform a rigorous clustering analysis with cross-validation on tens of thousands of atom volumes and attempt to compile them into types based purely on packing. From our analysis, we are able to determine a 'minimal' set of 18 atom types that most efficiently represent the spectrum of packing in proteins. Furthermore, we are able to uncover a number of inconsistencies in traditional chemical typing schemes, where differently typed atoms have almost the same effective size. In particular, we find that tetrahedral carbons with two hydrogens are almost identical in size to many aromatic carbons with a single hydrogen. AVAILABILITY Programs available from http://geometry.molmovdb.org. CONTACT [email protected]; [email protected]; [email protected] SUPPLEMENTARY INFORMATION Available at http://geometry.molmovdb.org.
منابع مشابه
A Mixed Integer Programming Approach to Optimal Feeder Routing for Tree-Based Distribution System: A Case Study
A genetic algorithm is proposed to optimize a tree-structured power distribution network considering optimal cable sizing. For minimizing the total cost of the network, a mixed-integer programming model is presented determining the optimal sizes of cables with minimized location-allocation cost. For designing the distribution lines in a power network, the primary factors must be considered as m...
متن کاملMolecular Dynamics Simulation of Al Energetic Nano Cluster Impact (ECI) onto the Surface
On the atomic scale, Molecular Dynamic (MD) Simulation of Nano Al cluster impact on Al (100) substrate surface has been carried out for energies of 1-20 eV/atom to understand quantitatively the interaction mechanisms between the cluster atoms and the substrate atoms. The many body Embedded Atom Method (EAM) was used in this simulation. We investigated the maximum substrate temperature Tmax and...
متن کاملLabeling of Human Serum Albumin with Stable Isotope of Bromine; an in Vitro Study
Background: Possibility to trace-label albumin with isotopes results in information concerning its synthesis, breakdown, and distribution in the intra and extra cellular spaces. The iodination of albumin is a widespread procedure used in scientific studies. Bromine not only is more reactive and less expensive than iodine, but bonds more easily with many elements. Therefore, it could be a suitab...
متن کاملDetermining the Sample size for Estimation of the CCC-R Control Chart Parameters Based on Estimation Costs
In today's highly competitive industrial environment due to fast technology development, quality practitioners will to detect out-of-control situations and take actions whenever is necessary as soon as possible. Accordingly, new statistical procedures have been enhanced incessantly both to handle high yield processes along with looking for methods of minimizing all quality cost. CCC-r chart, th...
متن کاملComputational studies of carbon decorated boron nitride nanocones
Density functional theory ,(DFT) calculations have been performed to investigate the properties ofcarbon decorated (C-decorated) models of boron nitride (BN) nanocones. To this aim, the apex andtip of nanocone have been substituted by the carbon atoms to represent the C-decorated models. Theresults indicated that dipole moments and energy gaps could reveal the effects of C-decorations onthe pro...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Bioinformatics
دوره 17 10 شماره
صفحات -
تاریخ انتشار 2001